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(57) Abstract 

A computet comprises a plurality of processor modules 
(UOo-1 IOnX each having at least first mid second I/O connection 
interfaces (126, 1 30 A, 130ft, 130C I30D), a processor (122) con- 
nected to those interfaces (126, 13 OA, 130B, 130C, 130DX and 
read-write memory (118) connected to the processor. The first I/O 
connection interface (126) of each processor module (1 IOq* 1 lOp) is 
connected to a common bus (138). The second I/O c onne c ti on in- 
terface (130 A, 130B, 130C, 130D) of each processor module 
(I lOft-l 10u) is connected to a switch (150A. 150B* 150C 150D), the 
switch (150 A, l SOB, 150C, 150D) being operative to connect the 
second I/O connection interface (130 A, 130B, 130C, 130D) of a se- 
lected processor module (HOq-110^) selectively to the second I/O 
coimectiou interface (130 A, 130B, 130C, 130D) of any other pro- 
cessor module (HOo-HOh). A controller (142) is connected to the 
bus (138) for receiving over the bus (138) data pertaining to a first 
processor module (UOp-UOpX which requires access to Informa- 
tion, and to a second processor module (IIO^HOm). from which 
the required information is available, and for controlling the switch 
(130 A, l$0B» 150C 150D) to allow the required information to be 
transmitted from the first processor module (1 10o-UOn) to the sec- 
ond processor module (I IOq-1 10^). 
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COMPUTING MACHINE WITH 
HYBRID COMMUNICATION ARCH ITECTORE 

Background of the invention 
This invention relates to a computing machine 
5 with hybrid communication architecture, 

A digital computer solves a problem by break- 
ing the problem down into multiple steps. A compu- 
ter with a single processor is able to execute one 
step at a time. It takes an inordinate time to 
10 solve a complex problem by use of such a sequential 
mode of operation* By operating multiple proces- 
sors in parallel, it is generally possible to 
reduce substantially the time required to *olve a 
problem. 

X5 If multiple processors are operated in 

parallel, it is necessary for the processors to 
share data. One technique for sharing data among 
multiple processors is for there to be a common, or 
global, memory to which all the processors have 

20 access on an equal footing* A problem with this 
technique arises from the fact that only one pro- 
cessor can access the memory at a given time, and 
therefore contention -problems limit the number of 
processors that can be accommodated* The number of 

25 processors can be increased somewhat by use of 

coherent caching or crossbar switching, but such 
techniques are costly and cumbersome. 

h second method of allowing sharing of data by 
multiple processors involves use of a parallel 

30 communication bus, A bus allows a great deal of 

flexibility in communication, including the ability 
to broadcast data from one processor to many or all 
of the others in a single operation. Each proces- 
sor is able to execute independently until the need 

35 
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to communicat* arises. However, when communication 
ie necessary, contention problems arise since only 
one processor can transmit on the bus at a time, 
gse of multiple buses can reduce problems due to 
contention, but multiple buses also reduce flexi- . 
bility and add greatly to cost and complexity. 

A third technique for allowing multiple pro- 
cessors to share data is provided by point-to-point 
communication links. Processors that have built-in 
links are commercially available, and therefore 
they are very easy to provide. Links offer 
virtually unlimited expansion possibilities, since 
the number of communication paths increases 
whenever a processor is added. However, links are 
15 the most difficult to use, since the physical 

interconnection pattern must match the pattern of 
communication required by the program that is being 
. executed. If processors are added to or removed 
from the system, a new pattern of link connections 
20 must be established, and the program must be 
rewritten, recompiled or r at the very loast, 
relinked to match. Broadcasting a message is 
difficult and time consuming, since the message 
must be copied from one processor to the next until 
25 it has reached all processors. Since two different 
programs will generally require two different 
physical interconnection patterns, it has not 
hitherto been possible to execute multiple programs 
simultaneously using links for communication 
3G between processors unless the programs are 
specifically designed to require the same 
communication patterns. 

The difficulty of matching the physical 
interconnection pattern with the. pattern of comrau- 
35 nieation required by the program is partially 
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The read operation is identical to in except 
that the matching tuple is not removed from the 
tuple space* The eval operation is a specialised 
form of out * Out creates a passive tuple, whereas 
5 eval creates an active tuple. 

If a processor performs an jLn operation, it is 
necessary to search tuple space for the matching 
tuple* It would be time consuming to examine each 
tuple in turn to determine whether it matches the 
10 template, and therefore the tuples are classified 
and a directory is created to facilitate the 
search. In a multi-processor computer, different 
portions of the directory are accessed by the 
different processors, and in order to complete an 
15 ^ii operation, potentially all the processors must 

examine their portions of the directory against the 
template in order to determine whether a matching 
tuple exists* 

The Linda system makes the programmer's job 
20 vastly easier; since he need not know the source or 

the destination of his data. The directory is 
automatically consulted and used to match a request 
for a tuple with a tuple that is available. When a 
required tuple is not available, the requester 
25 waits until such a tuple becomes available* The 

Linda system can be implemented on any of the 
above-mentioned parallel processing architectures, 
most efficiently with global memory, next most 
efficiently with one or more buses, and least 
30 efficiently with links. 

Maintenance of the Linda distributed directory 
is well suited to the bus, because it enables 
notification to be broadcast to potentially inter* 
ested processors as tuples becomes available. The 
35 notification messages are short, and therefore a 
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alleviated by adding a link crossbar switch, by 
which the configuration of the physical intercon- 
nection pattern can easily be changed at the start 
'. of a. program to match the communication pattern 
5 required by the program. However , use of a link 
crossbar switch in this manner does not ease the 
job of 4efining a suitable* connection pattern, nor 
does it facilitate simultaneous execution of two or 
more programs using the same set of processors. 
10 £>• Gelerxxter, Generative Communication in Linda, 

ACM Trans* Prog. Lang, and Sys., Vol* 7, »o. 1, pages 
80-112 r (19 8S) describes a software system called 
Linda. * Linda is based on the use of tuples. A tuple 
is a collection of related data. Elements of a tuple 
15 are fields holding actual values or formal*. There 
are two types of tuples, namely passive tuples and 
active tuples. A passive tuple is simply a collec- 
tion of data items , whereas an active tuple is a 
process which becomes a passive tuple. Tuples exist 
20 an abstract space called tuple space. The tuple 

space may exist over multiple processors. Four prin- 
cipal operations can be performed on tuple space. 
The out operation is an operation that creates a . 
tuple and places it in tuple space. The in operation 
25 is the reverse of out s it specifies a tuple that it 
desires, in the form of a template, and the computer 
matches the template against all the tuples existing 
in tuple space* if a matching tuple is found, it is 
removed from tuple space and is returned to the 
30 requesting process. When no tuple matches, the in 
operation blocks, and the requesting process is 
suspended until another process, through an out 
operation, creates a matching tuple. At, this point, 
the requesting process- continues. An out operation 
35 can never block. 
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great volume of them can be moved over the bus in a 
abort period. However, if a tuple, which might 
contain a large block of data, is moved over the 
bus, updates to the directory may be blocked for 
5 long periods, resulting in poor system performance* 
Using links to maintain the Linda directory 
is very inefficient, since broadcasting requires 
repeated passing of the message from processor to 
processor* However, moving data blocks from produ- 

10 cer to requester is an operation to which links are 
well suited. Since there are many links, many such 
transfers can be taking place simultaneously if the 
right link connections are made* 

Other publications that refer to Linda indlude 

15 D. Gelernter, Programming for Advanced Computing, 
Scientific American, October, 1987, pages 91-98; 
r. Bjornson, ft). Carriero, 0« Gelernter and J • 
Leichter, Linda, the Portable Parallel . 
(1968); N. Carriero and 0, Gelernter, Applications 

20 experience with Linda, (1988); N* 

Carriero, D. Gelernter and J. Leichter, Distributed 
Data Structures in Linda, Proc* ACM Symp. on Prin- 
ciples of Prog. Lang., pages 236*242 (19B6); D. 
Gelernter, M* carriero, S. Chandran and 5. Chang, 

23 Parallel programming in Linda, Ptoc. int. Conf* on 
parallel Processing, pages 255-263 (1985); 
N. Carriero and D. Gelernter, The S/Net** 
Linda Kernel, ACM Trans. Comp. Sys., Vol. 4, No. 2, 
pages 110-129 (1986); M. Carriero, implementation 

30 of Tuple Space Machines, Yale University, Dept. of 
Computer Science, Research Report YALEU/DCS/RR-567 
(1987); N« Carriero and D. Gelernter, Integrating 
Multiple Tuple Spaces, the Pile System and Process 
Management in a Linda-based operating System, Yale 

35 University, Dept. of Computer Science, Research 
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Report YALEU/DCS/RK-Tcehnical Memo (19&8); M. 
Factor, and D. Gelernter, The Parallel Process 
Lattice as an Organizing System for Realtime 
Knowledge Daemons , (1988)* 

5 

Summary of the Invention 
A preferred embodiment af the present inven- 
tion is a computer which comprises a plurality of 
processor modules, each having at least first and 

10 second I/O connection interfaces, a processor con- 
nected to those interfaces r and memory connected to 
the processor. The first I/O connection interface 
of each processor module is connected to a common 
bus* The second I/O connection interface of each 

15 processor module is connected to a switch, the 
• switch being operative to connect the second I/O 
connection interface of a selected processor module 
selectively to the second I/O connection interface 
of any other processor module* A controller is 

20 connected to the bus for receiving over the bus 

data pertaining to a first processor module, which 
requires access to information/ and to a second 
processor module , from which the required informa- 
tion is available, an£ for controlling the switch 

25 to allow the required information to be transmitted 
from the first processor module to the second pro- 
cessor module by way of the switch. 

in a computer embodying the present invent ion , 
the Linda directory can be updated by broadcasting 

30 a template over the bus, and* when a processor 
module that has access to a matching tuple is 
Identified, the tuple is transmitted from the pro- 
ducing processor to the requesting processor in a 
non-broadcast fashion, by way of the switch. During 

35 the transmission, the bus is available to process 
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other updates and arrange other connections, so 
that many transmissions can occur simultaneously. 

grief Description of Drawings 
5 For a better understanding of the invention, 

and to show how the same may be carried into 
effect, reference will now be made, by way of 
example, to the accompanying drawings in which; 

FIG. 1 is a block diagram of a first networked 
10 computer system embodying the present invention, 

FtG- 2 is a more detailed block diagram of one 
of the computer stations shown in FIG. 1, 

FIG. 3 is a simplified block diagram of a 
second networked computer system embodying the 
15 invention, and 

FIG. 4 is a simplified block diagram Of a 
stand-alone computer embodying the invention. 

Detailed Description 

20 The computer system illustrated in FIG. 1 

comprises a user terminal 10, several computer 
stations 12 and a disc drive station 14. The user 
terminal 10 comprises a processor 16 which is 
connected to various user utilities, such as a 

25 display card 18, a hard and/or floppy disc drive 

card 20, and a keyboard card 24, through its memory 
bus 26. The memory bus is also connected to local 
random access memory (PAM) 28 and local read only 
memory (ROM) 30* The processor 16 has four link 

30 interfaces 44. A second processor 42 is connected 

over its xemory bus to local RAM 48 and has four 
link interfaces 52. In the preferred embodiment of 
the present invention, each processor 16, 42 is an 
Inmos IMS T800 transputer. One link interface of 

33 the processor 16 is connected to a link interface 
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of the processor 42, and the other three link 
interfaces of each processor are connected to 
respective demultiplexed interfaces of a 6sl byte 
domain multiplexer/demultiplexer Jmux/demux) 60, 
5- which has a multiplexed interface connected to a 
fiber optic transceiver 64. The mux/demux 60 and 
the fiber optic transceiver 64 are used to transmit 
data end instructions between the processors 16 # 42 
and a fioer optic cable 6B r which connects the user 
10 * terminal 10 to one of the computer stations 12. 

Messages are transmitted over the cable 68 at a 
rate of about 100 Mb/s, 

The mux/damux 60 has a multiplexer channel and 
a demultiplexer channel. The multiplexer channel 
. 15 of the mux/demux comprises, for each demultiplexed 
interface, a serial to parallel converter which 
receives serial data over its link in words of 
eight, bits at a rate 10-20 Hb/s and, for each 
eight-bit serial word, generates an eight-bit 

20 parallel word and applies it to a parallel bus with 
a four-bit tag that designates the particular 
demultiplexed interface that provided the serial 
word. Thus, for each eight-bit word received at a 
demultiplexed interface, a twelve-bit word is 

25 applied to the parallel bus. The parallel bus is 
connected to a high speed parallel-to-serial con- 
verter, which reads : the twelve-bit parallel words 
in turn and generates an electrical signal composed 
of a succession of twelve-bit serial words. The 

30 electrical signal provided by the parallel-to- 
serial converter is applied to the fiber optic 
transceiver 64. 

The demylt iplexer channel of the mux/demux 60 
comprises a ser ial-to-oarallel converter which 

35 receives twelve-bit setial words from the fiber 
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optic transceiver 64 and generates twelve-bit 
parallel words which are applied to the parallel 
bus. Bach twelve-bit word comprises a four-bit tag 
and an eight-bit data word. Six parallel-to-serial 
5 converters, which are connected to the demulti- 
plexed interfaces respect ively, read the parallel 
bus* The parallel-to-serial converter that is to 
read a particular data word is designated on the 
basis of the four-bit tag associated with the data 

10 word. Thus, in the transmit mode the mux/demux 

receives up to six serial signals and interleaves 
them to provide a single serial output signal at 
its multiplexed interface. The output signal of 
the mux/demux is applied to the fiber optic trans- 

15 ceiver 64, which launches an optical signal, coded 
in accordance with the signal provided by the 
■nux/demux, into the fiber optic cable 68. In the 
receive mode, a coded optical signal is received 
over the fiber optic cable and the fiber optic 

20 transceiver generates a serial electrical signal in 
response thereto. The serial electrical signal is 
applied to the multiplexed interface of the 
mux/demux 60 and the mux/demux demultiplexes it 
into up to six signals which are provided at the 

25 demultiplexed interfaces respectively of the 
mux/demux . 

When data or commands are being input into the 
system, the processors 16, 42 manipulate the data 
or commands in accordance with programs stored in 

30 the ROM 30 and apply the data or commands in serial 
fashion over the links, the mux/demux 60 and the 
transceiver 64 to the fiber optic cable 6B. When 
data or commands are received by the transceiver 
over the fiber optic cable 68, the serial signal is 

35 demultiplexed into up to six serial signals which 
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are applied to the processors 16 r 42 over the 
links, and the processors apply the data or 
commands to the display card 13 or the disc drive 
card. 20* 

5 ■ As shown in FIG* 2, each computer station 12 com- 

prises several processor modules 110 r each of which 
.is composed of a processor 114 and a random access 
memory lid. In the preferred embodiment of the 
present invention* the processor 114 is an xnmos 

10 IMS T800 transputer, comprising. a CPU 122, an 

external parallel interface 126 and four link inter- 
faces 130A-120D» The processor 114 also includes , 
other components, taut these are not relevant to an 
understanding of the present invention and there- 

15 fore are not illustrated in the drawings* 

The CPU 122, memory 118, parallel interface 
126 and link interfaces 130A-133D of each processor 
114 communicate with each other over an internal 
32-ait parallel bus 134. The parallel interface 

20 126 is connected, to an external 32-bit parallel bus 
138, which is connected to the external parallel 
interface of each other processor module, a cross- 
bar switch operator 142 and at least one external 
communication module 144. - in the preferred embodi- 

25 ment of the present invention, the crossbar switch 
operator 142 is an Inmos IMS T414 transputer, which 
has essentially the same architecture as the IMS 
T800 transputer. . 

The link interfaces of the processor modules 

30 connected to a programmable crossbar switch 

14$, which is implemented by four mmos IMS C004 
programmable link switches 15OA-150D. Each link 
switch has. 32 data link connections. Respective 
data link connections of the link switch 150A, for 

35 example, are connected to the link interfaces 130A 



PAGE 17/36 1 RCVD AT 11/10/2004 5:01:18AM [Eastern Standard frnie] 1 SVR:USPTO^FXRF-1/0 1 DNIS:8729306* CSID:8064986673* DURATION (mm-ss):10-20 



10-11- *04 18:06 fg- WINSTON HSU, NO. 41526 8064986673 



T-164 P18/36 U- 



WO 89/09967 PCT/US89/01456 

11 

of the processors 114- Each link switch also has a 
configuration link connection 158 over which it 
receives a signal for establishing the manner in 
which the data link connections are connected by 
5 the switch. The configuration link connections 15B 

of the four link switches are connected to respec- 
tive link interfaces of the operator 142. 

The crossbar switch 146, the switch operator 142 
and the parallel bus 13B are all carried by a mother 
10 board having sixteen connection slots. Bach connec- 

tion slot can receive either a communications card, 
which carries a communication module 144, or a pro- 
cessor card, which carries two processor modules 110. 
Therefore, the maximum number of processor modules 
15 that can be accommodated (it being necessary to have 
at least one communication module) is thiry. The 
four programmable link switches provide an aggregate 
of 128 link connections, and the maximum of thirty 
processor modules occupy 120 of these data link 
20 connections. The other eight link connections are 
connected to the external communication module, for 
purposes which will be described below. if there is 
more than one communication module (and consequently 
fewer than thirty processor modules), each cotnmunica- 
25 tion module is connected to two link connections of 
each link switch. 

Referring again to TIG, 1, the disc drive 
station 14 comprises a single processor 160 which 
is connected over its memory bus 168 to a high 
30 speed disc drive card 162 and to local RAM and 

local ROM. The processor 160, which may be an 
inmos IMS T800 transputer, has four link interfaces 
which are connected to a 4:1 mux/demux 164, The 
mux/dejnux 164 of the disc drive station 14 is 
35 essentially the same as the mux/demux 60 of the 
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user terminal 10, and is connected to a fiber optic 
transceiver 166. The disc drive station provides 
high speed disc access without being burdened by 
the need to generate a display. 
5 The computer stations 12 shown in FIG* 1 each 

have two communication modules 144 r which enable 
the stations 12 to be connected in a linear 
arrangement between the user terminal and the disc 
drive station. Bach computer station therefore "can 

10 accommodate up to twenty-eight processor modules 

110. Alternatively , the user terminal 10, the disc 
drive station 14 and the computer stations 12 may 
be connected in a star arrangement, as shown in 
PIG. 3. In this arrangement, the computer stations 

15 12A-12D each need only one communication module, 

but the computer station 12E has six communication 
modules. A third possibility is for the computer 
stations to be organized as a hierarchical tree. 
Numerous, other arrangements, .employing different 

20 interconnection schemes among the user terminal, 

the disc drive station and the necessary processor 
nodules, are possible. 

Each computer station 12 executes an applica- 
tion by use of the Linda language. . Data are 

25 received by the computer station by way of an 

external communication module 144 and are stored in 
the external memories 118* The data stored in the 
memories 118 are associated as tuples, and each 
external memory 118 includes a directory portion 

30 containing information regarding the tuples that 

ar£ stored, in that memory* When the processor 114 
of a processor module 110 executes an out opera** 
tion, the tuple generated in the out operation is 
loaded into the processor module's external memory 

35 and the processor module's portion of the directory 



PAGE 19/36 * RCVD AT 1 1/10/2004 5:01:18 AM [Eastern Standard Time] * SVR:USPTO-EFXRF-1/0 * DNIS:8729306 * CSID:80W986673 * DURATION (mm-$$):10-20 



10-11- '04 18:06 tg- WINSTON HSU, NO. 41526 8064986673 



T-164 P20/36 U- 



WO 89/09967 PCT/US89/01456 

13 

is updated to reflect addition of this tuple* When 
a processor module performs an _in operation, it 
first examines its own portion of the directory 
against the template that defines the desired 
5 tuple* If no match is found, the requesting 

processor module may broadcast the template over 
the parallel bus 13d to other processor modules of 
the computer station. The receiving processor 
modules examine their respective portions of the 

10 directory* and the first processor module that 

finds a match places a signal on the bus to indi- 
cate that the other processor modules should cease 
the search. The requesting and producing processor 
modules then provide signals to the operator 142, 

15 and the operator responds by causing the switch 146 
to establish a connection between a link interface 
of the requesting processor module and the corres- 
ponding link interface of the producing processor 
module. The matching tuple is then transmitted 

20 from the producing processor module to the 

requesting processor module through the links and 
the crossbar switch 114. and does not occupy the 
bus 138. 

When an in or read operation takes place* the 
25 directory portion of the requesting processor 
module is updated to reflect the fact that the 
tuple space of that processor module now contains 
the specified tuple, similarly, when an in opera- 
tion (but not a read operation) is performed* the 
30 directory portion of the producing module is 

updated to reflect the fact that it no longer has 
the specified tuple in its tuple space. When an 
out or eval operation is performed* the directory 
portion of the module that executes the opera- 
35 tion is updated* It will therefore be seen that it 
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is not. necessary to burden the bus with messages 
pertaining to the contents of the tuple space of 
each processor module. 

Bus operation has four distinct cycles. A 
5 processor that requires access to the bus 138 in 
, " order to transmit a message asserts a bus request 
signal on a bus control line and an arbitration 
. cycle takes, place* If no other processor requires 
access to the bus at that time, the first-mentioned 

10 processor wins the arbitration by default. If one 
or more other processors requires aecess to the 
bus, distributed arbitration logic ensures fair 
arbitration among the processors that require 
access* When the arrbitration cycle is complete, a 

15 selection cycle takes place* The transmitting 

processor writes a single 32-bit word onto the bus. 
if the computer station has twenty-eight processor 
modules and two communication modules, twenty-eight 
.. bits of thie word define, on a one bit per module 

20 basis, a mask of the processor modules that are to 
receive the ensuing message* Two more bits deter- 
mine* whether the external communication modules are 
to receive the message. This is the selection 
operation. Thus, with a single bus cycle, the 

25 transmitting processor can select any one or more 
of the other processors to receive its message. 

Bach processor that is selected by the trans- 
mitting processor to receive its message receives an 
interrupt from its parallel interface. The interrupt 

SO forces the receiving processor into a receive message 
mode, in which each receiving processor reads the 
parallel bus. The transmitting processor receives a 
status bit that indicates whether a receiving processor 
is in the receive message mode, and does not transmit 
. 35 data until all the receiving processors are in the 
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receive message mode* When all the selected pro* 
cessors are in the receive message mode, the 
transmitting processor and the receiving processors 
are interlocked and the selection cycle is com- 
5 plete. A transmission cycle then occurs, in which 
the transmitting processor transmits its message 
over the bus in words of 32 bits. The transmitting 
processor holds its data word on the bus until it 
has received an acknowledgement bit from each 
10 receiving processor* The acknowledgement bit indi- 
cates that a receiving processor has read the data 
word from the bus. The transmitting processor then 
ends its write cycle and each receiving processor 
ends its read cycle. The transmitting processor 

15 then enters another write cycle, in which the next 
data word is transmitted. The first word placed on 
the bus during the transmission phase represents 
the number of words to be transmitted. The 
receiving procesors count the number of words 

20 actually transmitted, end when the number trans- 
mitted is equal to the number represented by the 
first word, they process the message. The trans- 
mitting processor enters a disconnect cycle in 
which it negates its bus request, and this allows 

25 arbitration to take place again. The processors 

that were previously selected are deselected. The 
transmission cycle is then complete, and the bus is 
available for another transmission. 

In the system described, transmissions are 

30 accomplished without use of FIFO buffers. How- 
ever, this is done at the expense of requiring the 
receiving processors to all operate synchronously 
during a transmission, and this might not be 
desirable. Therefore, each processor module could 

35 include a FZFO buffer for receiving and temporarily 
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holding data transmitted to that processor module 
over the bU8. la this fashion, the receiving 
processors are able to run independently of each 
other. Of course, a FIFO buffer has' limited capa- 
5 city, and if one or more of the buffers is filled 
during a transmission, it would be necessary to 
. fall back on the previously-described mode of 
transmitting data from a transmitting processor to 
the receiving processors* 

10 Each link is composed of two lines, allowing 

bi-directional communication between two proces- 
sors. The producing processor transmits words of 
eleven bits serially over the link using one line, 
the first hit of each word being a start bit and 

15 the lajst two bits being an ending* On receipt of 

the first bit of a word, the requesting processor 
transmits an acknowledgment to the producing pro- 
cessor over the other line of the link- The 
acknowledgment code is a unique two-bit code and is 

20 received by the producing processor before it com- 
pletes transmitting its word* In fact, the length 
of the serial word is such that the acknowledgment 
code can travel through three link switches and 
' still be received by -the producing processor, indl- 

25 eating that the next word can be sent, before the 
transmission of the first word is completed. 
Accordingly, the producing processor can send a 
second word immediately after the first word, 
without having to wait until after the end of the 

30 first word to receive an acknowledgement. 

By use of the hybrid communication architec- 
ture that has been described and illustrated, the 
computer station shown in FIG. 2 is able to recon- 
figure the pattern of link connections among its 

35 processors dynamically, in response to changes in 
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the communication pattern required by the program 
that ia being executed. Also, multiple programs 
can be run simultaneously, without regard to the 
communications patterns required by the program* 
5 respectively, since the pattern of link connec- 
tions is not fixed at the start of a program. 
The programmer need not consider the communica- 
tions pattern that will be required by a program 
that he is writing, since a suitable pattern of 

10 link connections is established automatically. 

The user terminal 10 can run the same programs 
as the computer stations 12. However, when it is 
used as the terminal for a network, the processors 
16 and 42 do not run applications but are concerned 

15 with graphics, creating the display, reading the 
keyboard and accessing the disc. The disc drive 
station 14 is particularly useful if disc access 
cannot be accomplished by the user terminal ID with 
sufficient speed when it is having to perform other 

20 functions. 

as noted previously, each computer station 12 
includes at least one communication module 144. As 
shown in FIG. 2, each communication module com- 
prises a processor 170, such as an inmos IMS T800 

23 transputer, having a parallel bus interface and 

four link interfaces. The parallel bus interface 
is connected to the parallel bus 138 and the four 
link interfaces are connected to respective link 
interfaces of a 12:1 mux/demux 172. The other 

30 eight link interfaces of the mux/demux are con- 
nected to eight link connections of the switch 146. 
Except for the number of link interfaces, and con- 
sequently the potential maximum speed of operation, 
the mux/demux 172 of the communication module 

35 operates in essentially the same way as the 
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roux/demux 60 of the user terminal 10. The communi- 
cation modules enable the computer stations to 
exchange tuple requests and tuples. 

When a requesting processor of a computer sta- 
5 tion 12 generates a mask that defines the modules 

that are to receive the template defining a desired 
...tuple, the mask will generally include the communi- 
cation module (s) of that station. The message 
that is transmitted by the requesting processor 

10 .. module includes the name of the tuple space that 

would contain the desired tuple, if it exists. The 
communication module stores the names of the tuple 
spaces that are associated with each of the other 
computer stations,, and if any of the other computer 

15 stations with which that communication module can 
communicate by way of its transceiver 174 is asso- 
^ elated with the named tuple space, the communica- 
tion module will transmit the template over the 
fiber optic cable. If the message is received by a 

20 computer station that includes a processor that is 
associated with the na^ed tuple space, the proces- 
sor of the receiving communication module directs 
the message over the parallel bus 138 to the 
appropriate processor module(e). If the message is 

25 received at one communication module of a proces- 

sor having a second communication module that is 
connected to a computer .station that includes, a 
processor that is associated with the named tuple 
space, the processor of the receiving communica- 

30 - tion module directs the message over the parallel 
bus 138 to. the other communication module for 
retransmission. Ultimately, the message reaches 
all computer stations having processors that are 
associated with the named tuple space. 
35 When a matching tuple is found in a computer 
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station other than the one that contains the 
requesting processor, the tuple is transmitted 
through the crossbar switch of the producing compu- 
ter station to a communication module of that sta- 
tion and is transmitted serially to a communication 
module of the requesting computer station. This 
might involve the tuple's passing through one or 
more intermediate computer stations, and being 
passed between communication modules of an interme- 
diate station through the crossbar switch of that 
station* When the tuple reaches the requesting 
computer station, it is transmitted to the 
requesting processor module by way of the crossbar 
switch. since, as noted previously, a requesting 
processor issues its acknowledgment code at the 
beginning of a data word, there is no significant 
performance penalty with respect to transmission of 
tuples from computer station to computer station in 
the described network arrangements* 
20 jt will be appreciated that tne present 

invention is not restricted to the particular 
embodiment that has been described and illustrated, 
and that variations may be made therein without 
departing from the scope of the invention as 
25 defined in the appended claims and equivalents 

thereof. For example, the invention is not 
restricted to use with any particular type of 
processor. It. is necessary only that the processor 
be sole to support two communication modes, one 
30 of which is suitable for transmission of data 

pertaining to information required by the processor 
and the other of which is suitable for transmission 
of tne information itself. The invention is not 
restricted to use with computers configured for 
35 connection in a network. If the FIG. 1 computer 
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was designed, for stand-alone use, it could 
accommodate up to thirty-two processor modules* 
Further, the invention is not restricted to use of 
a crossbar switch to interconnect the link 
5 interfaces of the processors. Five processor 

modules each having four link interfaces could each 
have one link interface hard wired to a link 
interface of each other processor module, as shown 
in FIG. 4. in this case, a producing processor is 
10 always able to transmit a tuple to a requesting 
processor by way of the link interfaces by which 
they are connected. 

15 
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Claims 

1. A computer which comprises a plurality of 
processor modules, each having at least first and 
second Z/O connection interfaces w a processor con- 
nected to those interfaces^ and read-write memory, 
5 a common bus to which the first I/O connection 

interface of each processor module is connected, 
switch means connected to the second I/O connection 
interface of each processor module and operative to 
connect the second I/O connection interface of a 

10 selected processor module selectively to the second 
2/0 connection interface of another processor 
module, and a switch operator connected to the bus 
for receiving over the bus data pertaining to a 
first processor module, which requires access to 

15 information, and to a second processor module, from 
which the required information is available, end 
for controlling the switch means to allow the 
required information to be transmitted from the 
first processor module to the second processor 

20 module by way of the switch means* 

2. A computer according to claim 1, wherein 
the common bus is a parallel bus and the first I/O 
connection interface of each processor module is a 
25 parallel connection interface. 

3* K computer according to claim 1, wherein 
the second I/O connection interface of each proces- 
sor module is a link interface* 

30 
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4 9 A computer according to claim l r further 
comprisins an external communication module which 
includes an external communication interface and is 
connected to the bus and to the switch means and is 
5 operative to transmit data between the external 

communication interface and the processor modules 
! by way of the switch means or the bus. 

5. A computer according to claim 4, wherein 
10 . the external communication interface is a fiber 
optic transceiver.. 

6m A computer system which comprises at least 
first and second' computer stations each comprising: 
15 Ca> a plurality of processor modules, 

each having at least first and second I/O connec- 
tion interfaces, a processor connected to those 
interfaces, and read-write memory; 

(b> a common bus to which the first I/O 
20 connection interface of each processor module is 
.. connected; 

(c) switch means having a plurality 
connection interfaces connected to the second l/o' 
connection interfaces of the plurality of processor 
25 * modules respectively and also having at least one 
additional connection interface, the switch means 
being operative to connect any one of its connec- 
tion interfaces to any of its other connection 
interfaces; 

30 ^ (d) a switch operator connected to the 

bus for receiving over the bus data pertaining to a 
fi«t processor module, which requires access to 
information, and to a second processor module, from 
which the required information is available, and 

35 for controlling the switch means to allow the 
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required information to be transmitted from the 
first processor module to the second processor 
module by way of the switch means; and 

(e) at least one external communication 
5 module which includes an external communication 
interface and is connected to the bus and to the 
switch means and is operative to transmit data 
between the external communication interface and 
the processor modules by way of the switch means or 
10 the bus* 

7. A computer system according to claim 6, 
wherein the external communication interface of a» 
external communication module of the first compote? 

15 station is connected to the external communication 
interface of an external communication module of 
the second computer station. 

8. K computer system according to claim 6, 
20 wherein the external communication interface is a 

fiber optic transceiver. 

9« A computer system according to claim 6* 
wherein the common bus of each computer is a 
25 parallel bus and the first I/O connection interface 
of each processor module is a parallel connection 
interface. 

10. A computer system according to claim 6* 
30 wherein the second I/O connection interface of each 

processor module is a link interface and the 
additional connection interface of the switch means 
of each computer is a link interface. 

35 
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11. A computer system according to claim 6, 
further comprising a utter terminal having display 
means, data input means, at least one processor 
connected to the display means and data input 

3 means and having at least one I/O connection 
interface, and external communication means 
' C * connected to the Z/O connection interface and 

having an external communication interface which is 
connected to an external communication interface 
lCt of at least one computer station for transmitting 
data between 'said one computer station and the 
processor of the- user terminal. 

12. a method, of operating a computer com- 
15 prising a plurality of processors each having at 

least first and second I/O connection interfaces 
and read-wtlte memory « tbe method comprising: 

(a) in the event that a first of said 
processors requires access to information that is 

20 . not present in its memory , transmitting a request 
for information to at least a second of said pro- 
cessors by way of the first I/O connection inter- 
faces of the first and second processors, and 

(b) in the event* that the required infor- 
25 matron is contained in the memory of the second 

processor, transmitting the required information 
from the second processor to the first processor by 
. way of the second I/O connection Interfaces of the 
first and second processors* 

30 
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13. a method of operating a compter, system 
which comprises at least first and second computer 
stations each comprising a plurality of processors 
each having at least first and second I/O connec- 
5 tion interfaces and read-write memory, and each 

computer station also comprising external communi- 
cation means, the method comprising i 

(a) in the event that one processor of the 
first computer station requires access to informa- 

10 tion that is not present in the memory of any of 
the processors of the first computer station, 
transmitting a request for information to at least 
the second computer station by way of the first X/O 
connection interface of said one processor and the 

15, external communication means of the first and 
second co&puter stations, and 

(b) in the event that the required infor- 
mation is contained in the memory of a processor of 
the second computer station, transmitting the 

20 Esquired information from that processor to said 

one processor by way of the second I/O connection 
interface of the second processor, the external 
communication means of the second and first compu- 
ter stations, and the second I/O connector inter- 

25 face of the first processor* 
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